````````````## Where do redditors want to go? Analysis of posts from [IWantOut](https://www.reddit.com/r/IWantOut/), a reddit forum containing posts of people who would like to move to another country. Almost all posts in the reddit forum contain the origin country of the redditor and his/her dream destination country/countries. Below is the Sankey Diagram showing the origins and desired destinations of redditors. A wider light puple line indicates a higher number of posts. You can view an expanded version of the Sankey diagram [here](https://rouenlee29.github.io/reddit-iwantout/iwantout_sankey.html).
Most redditors come from North America (not surprising, reddit is most popular in the US). Europe is the most desired destination (should I be surpised?). ## Who are the redditors
## Analysis of posts I looked at the posts of redditors by destination region ### Words with high Tf-idf scores, by destination region 1 - One _document_ is a collection of ALL posts with the same destination region. For example, all posts with destination as Asia will be collated into one document. - Every word in a document will be assigned a _Tf-idf score_. - A word in a document has a high Tf-idf score if it appears almost exclusively in the document and appears very frequently in the document. [Read more](http://www.tfidf.com/). - The bigger the word in the diagrams below, the higher its Tf-idf score. Europe | Africa :-------------------------:|:-------------------------: ![](images/europe.png) | ![](images/africa.png) Oceania | Asia :-------------------------:|:-------------------------: ![](images/oceania.png) | ![](images/asia.png) Europeaan Union2 | North America :-------------------------:|:-------------------------: ![](images/europeanunion.png) | ![](images/northamerica.png) Observations - The word "work" appears very frequently across all posts. - Seems like "visa" is a concern to those wishing to go to Western countries. - Those looking to move to the Euopean Union seem aware of the potential need to learn a new language ## Analysis of jobs ### Words with high counts, by destination region Some redditors reveal their jobs in the post titles. I have extracted this information and performed a word count by destination region. Seems like North America and Europe are attracting people with different skillsets: Europe | North America :-------------------------:|:-------------------------: ![](images/jobs_europe.png) | ![](images/jobs_northamerica.png) ## Notes 1. What do you mean by _region_? A region is more than a country, but less than or equal to a continent. 2. What is the difference between _Europe_ and _European Union_? - If a redditor writes a specific destination country (for example, United Kingdom, Netherlands or Germany) in Europe, the destination region will be _Europe_. - If a redditor writes "EU" as his/her destination, then the destination region will be _European Union_. Code to scrape, transform and create the charts can be found in [my github repo](https://github.com/rouenlee29/reddit-iwantout).